Reflexive Regular Equivalence for Bipartite Data
نویسندگان
چکیده
Bipartite data is common in data engineering and brings unique challenges, particularly when it comes to clustering tasks that impose strong structural assumptions. This work presents an unsupervised method for assessing similarity in bipartite data. The method is based on regular equivalence in graphs and uses spectral properties of a bipartite adjacency matrix to estimate similarity in both dimensions. The method is reflexive in that similarity in one dimension informs similarity in the other. The method also uses local graph transitivities, a contribution governed by its only free parameter. Reflexive regular equivalence can be used to validate assumptions of co-similarity, which are required but often untested in co-clustering analyses. The method is robust to noise and asymmetric data, making it particularly suited for cluster analysis and recommendation in data of unknown structure. In bipartite data, co-similarity is the notion that similarity in one dimension is matched by similarity in some other dimension. Such data occurs in a many areas: text mining, gene expression networks, consumer co-purchasing data and social affiliation. In bipartite analyses, co-clustering is an increasingly prominent technique in a range of applications [11], but has strong co-similarity assumptions. One example of co-similar structure is in text analysis where similar words appear in similar documents, where there is assumed to be a permutation of the word-document co-occurrence matrix that exposes co-similarity among words and documents [4]. The work here describes a way to assess co-similarity using regular equivalence [8] with a reflexive conception of similarity that accommodates nodes’ (data-points) local structures. This assessment is a kind of pre-condition for co-clustering: if there is little co-similarity, co-clustering will yield a poor clustering solution. Assessing co-similarity will produce similarity in one dimension to expose potential clustering without requiring it across dimensions. This is particularly useful for asymmetric data when a non-clustered dimension informs, but does not reciprocate clustering in the other. Whereas as co-clustering finds clusters across dimensions, our method provides a decoupled solution in each mode. Reflexive regular equivalence is able to quantify how much one dimension informs similarity in the other. Additionally, our results show that by incorporating local structures, it can better overcome noise and accommodate asymmetry. 4 An extended preprint of this paper is available at arxiv.org/abs/1702.04956.
منابع مشابه
ON THE COMPATIBILITY OF A CRISP RELATION WITH A FUZZY EQUIVALENCE RELATION
In a recent paper, De Baets et al. have characterized the fuzzytolerance and fuzzy equivalence relations that a given strict order relation iscompatible with. In this paper, we generalize this characterization by consideringan arbitrary (crisp) relation instead of a strict order relation, while payingattention to the particular cases of a reflexive or irreflexive relation. The reasoninglargely ...
متن کاملHamiltonian decompositions of random bipartite regular graphs
We prove a complete hamiltonian decomposition theorem for random bipartite regular graphs, thereby verifying a conjecture of Robinson and Wormald. The main step is to prove contiguity (a kind of asymptotic equivalence) of two probabilistic models of 4-regular bipartite graphs; namely, the uniform model, and the model obtained by taking the union of two independent, uniformly chosen bipartite Ha...
متن کاملFuzzy number-valued fuzzy relation
It is well known fact that binary relations are generalized mathematical functions. Contrary to functions from domain to range, binary relations may assign to each element of domain two or more elements of range. Some basic operations on functions such as the inverse and composition are applicable to binary relations as well. Depending on the domain or range or both are fuzzy value fuzzy set, i...
متن کاملSignature submanifolds for some equivalence problems
This article concerned on the study of signature submanifolds for curves under Lie group actions SE(2), SA(2) and for surfaces under SE(3). Signature submanifold is a regular submanifold which its coordinate components are differential invariants of an associated manifold under Lie group action, and therefore signature submanifold is a key for solving equivalence problems.
متن کاملCodes in Bipartite Distance^regular Graphs
For each bipartition_of a bipartite distance-regular graph F, there naturally corresponds another distance-regular graph F called a halved graph. It is shown that the existence of a perfect e-code in a halved graph F is equivalent to the existence of a uniformly packed 2e-code in F with certain specific parameters. Using this equivalence, we show the non-existence of perfect codes for two class...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017